Extraction of Reliable Transformation Parameters for Unsupervised Speaer Adaptation
نویسندگان
چکیده
Adaptation of speaker-independent hidden Markov models (HMM’s) to a new speaker using speaker-specific data is an effective approach to reinforce speech recognition performance for the enrolled speaker. Practically, it is desirable to flexibly perform the adaptation without any knowledge or limitation on the enrolled adaptation data (e.g. data transcription, length and content). However, the inevitable transcription errors on adaptation data may cause unreliability in model adaptation. The variable amount and content of adaptation data require the algorithm to dynamically control the degrees of sharing in transformation-based adaptation. This paper presents an unsupervised hierarchical adaptation algorithm where a tree structure of HMM’s is incorporated to control the transformation sharing. To extract reliable transformation parameters, we exploit the reliability assessment criteria using the confidence measure and description length. Experiments show that the unsupervised speaker adaptation with reliability assessment can significantly improve the recognition performance for any lengths of adaptation data.
منابع مشابه
Extraction of reliable transformation parameters for unsupervised speaker adaptation
Adaptation of speaker-independent hidden Markov models (HMM’s) to a new speaker using speaker-specific data is an effective approach to reinforce speech recognition performance for the enrolled speaker. Practically, it is desirable to flexibly perform the adaptation without any knowledge or limitation on the enrolled adaptation data (e.g. data transcription, length and content). However, the in...
متن کاملPrior parameter transformation for unsupervised speaker adaptation
In a strictly Bayesian approach, prior parameters are assumed known, based on common or subjective knowledge. But a practical solution for maximum a posteriori adaptation methods is to adopt an empirical Bayesian approach, where the prior parameters are estimated directly from training speech data itself. So there is a problem of mismatches between training and testing conditions in the use of ...
متن کاملDeep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning
Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...
متن کاملOnline adaptation of HMMs to real-life conditions: a unified framework
This paper introduces a unified framework for online adaptation of hidden Markov models (HMM) parameters to real-life conditions. Hence, it aims at improving the robustness of speech recognition systems. In addition, it describes some techniques developed to control the convergence of adaptation in unsupervised modes. Classically, two approaches have been used to adapt HMM parameters to new con...
متن کاملStructural maximum a-posteriori linear regression for unsupervised speaker adaptation
In this paper we introduce an approach to transformation based model adaptation techniques. Previously published schemes like MLLR define a set of affine transformations to be applied on clusters of model parameters. Although it has been shown that this approach can yield good results when adaptation data is scarce, an inherent problem needs to be considered: the number of transformations used ...
متن کامل